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Abstract 



Suppose we have k matrices of size n x n. We are given an oracle that knows all 
the entries of k matrices, that is, we can query the oracle an entry of the l-th 
matrix. The goal is to test if each pair of k matrices commute with each other or not 
with as few queries to the oracle as possible. In order to solve this problem, we use a 
theorem of Mario Szegedy [Sze04b, Sze04a] that relates a hitting time of a classical 
random walk to that of a quantum walk. We also take a look at another method of 
quantum walk by Andris Ambainis [Amb04a]. We apply both walks into triangle 
finding problem [MSS05] and matrix verification problem [BS05] to compare the 
powers of the two different walks. We also present Ambainis's method of lower 
bounding technique [AmbOO] to obtain a lower bound for this problem. It turns 
out Szegedy's algorithm can be generalized to solve similar problems. Therefore 
we use Szegedy's theorem to analyze the problem of matrix set commutativity. We 
give an 0(k^ 5 n 9 ^ 5 ) algorithm as well as a lower bound of f2(/c 1//2 n). We generalize 
the technique used in coming up with the upper bound to solve a broader range of 
similar problems. This is probably the first problem to be studied on the quantum 
query complexity using quantum walks that involves more than one parameter, 
here, k and n. 
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Chapter 1 
Introduction 



1.1 The Model, Motivation, and the Main Re- 
sults 

Suppose we are given a set X of size n and we want to test if the set satisfies a given 
property. We are also given an oracle that computes f(i) for some index % in the 
set. For example, in element distinctness [Amb04a], X is a set of integer variables, 
{xi,x 2 , ■ ■ ■ ,x n } and the property to test is whether there are two different indices 
i and j such that Xi = Xj. In order to decide if X satisfies the property, we query 
the oracle for values f(i) = x^ at various indices %. In general, we are interested 
in minimizing the classical or quantum query complexity, the number of queries 
a classical or quantum algorithm make to the oracle. This notion will be defined 
formally in Section 1.2.5. 

We are interested in studying classical and quantum query complexities because 
an oracle sometimes gives a separation between them. For example, de Beaudrap, 
Cleve and Watrous showed one problem where we need an exponentially many 
queries in the bounded error classical case, but only a single query is needed in 
the quantum case [dBCW]. Another occasion to study a query complexity is when 
obtaining a time complexity is hard. In such a case, the number of queries we make 
gives a lower bound for the time complexity. In fact, currently there is no lower 
bound method for quantum time complexity that gives super-linear bounding, and 
by studying quantum query complexity, we get lower bounds heuristic on quantum 
time complexity. 

One of the powers of quantum computation comes from the fact that we can 
query in superposition. That is, if we are given a set of n elements from 1 to n 
denoted [n], we can query an oracle in parallel once to obtain a superposition of 



1 



CHAPTER 1. INTRODUCTION 



2 



/(l) through f{n). However, as we will see in Section 1.2.2, we can in a sense only 
learn one of the /(i)'s from such a query. The real power of quantum computation 
comes from interference. That is, the information in the states, e.g., /(i)'s, can 
be combined by means of unitary quantum gates in a non-trivial way, and we can 
extract a global property of the inputs. For example, in Deutsch's algorithm, given 
two input bits indexed by and 1, we cannot obtain both /(0) and /(l) in one 
oracle query. However, by making a suitable quantum query, we can obtain a global 
property, /(0) © f(l) [Deu85]. This interference is also used for quantum search 
in an unstructured database, in an algorithm due to Grover [Gro98], to extract a 
global property, i.e., if the set we are given contains an element we are looking for. 

It turns out we can generalize Grover's search to test if a set we have satisfies 
a given property using a quantum version of a random walk, called a quantum 
walk. Using a quantum walk, for example, element distinctness can be solved 
in 0(n 2//3 ) [Amb04a] queries with a matching lower bound of Vt(n 2 ^) [AS04]. A 
quantum walk was first studied on the line, both discrete [ABN + 01] and contin- 
uous [FG98], analogous to classical discrete and continuous random walks, except 
that a quantum discrete walk uses a coin to decide which point to move to next, 
whereas a quantum continuous walk does not. The discrete quantum walk on the 
line showed that the probability distribution after certain number of steps of quan- 
tum walk is different from that of the classical probability distribution [ABN + 01]. 
The continuous quantum walk was then applied to a graph that gave an exponential 
speed up in a hitting time as compared to the classical counterpart [CFG02]. The 
discrete quantum walk on the line was also extended to general graphs [AAKV01] 
and later applied to a search on a hypercube [SKW03]. Both discrete [AKR05] 
and continuous [CG04] walks were applied to search an item on a grid. Ambai- 
nis [Amb04a] used a discrete quantum walk to solve element distinctness. This is 
generalized in [MSS05] to find a three clique in a graph (triangle finding). Szegedy 
proposed a different quantization of a classical Markov chain in [Sze04b, Sze04a]. 
He showed that there is a quadratic speedup for the hitting time of his quantization 
of classical walk. Szegedy's quantization was applied in [BS05] to verify a prod- 
uct of two matrices (matrix verification). For more details in the development of 
quantum walk based algorithms, see [Amb04b]. 

The goal of this essay is to investigate the query complexity of testing the 
commutativity of k matrices of size nx n. This essay is probably the first to study 
quantum query complexity that involves two variables, k, the number of matrices 
in the set and n, their dimension. We show that there are three upper bounds for 
this problem, 0(kn 5 ^ 3 ), 0(k 2 ^n 2 ) and 0(/c 4 / 5 n 9//5 ), depending on the relationships 
between the variables k and n. We also show a lower bound of f2(A4/ 2 n). 

The organization of the essay is as follows. We first introduce the mathematical 
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background necessary to understand our quantum algorithms in Section 1.2. Then 
we take a look at the details of Szegedy-Walk in Section 2.1 and Ambainis-Walk 
in Section 2.2. We use these two walks to analyze triangle finding problem in 
Section 2.3 to see a case where Amabinis-Walk performs better. In Section 2.4.1, 
we take a look at a quantum adversary method [AmbOO] to obtain a lower bound 
for our problems. We shift our focus to matrices next and in Section 2.5, we study 
matrix verification problem. In Chapter 3, we finally study the problem of testing 
the commutativity of k matrices of size n x n. We first take a look at a case 
where k = 2 by using a modification of matrix verification in Section 3.1. Next 
we study four different algorithms for a general k in Section 3.2. This problem is 
generalized in Section 3.3. Finally we give a summary and directions for future 
work in Chapter 4. 



In this section, we will go over the mathematical background necessary to follow 
the algorithms in this essay. Beyond the content in this section, [NCOO] is a good 
reference in general introductory material in quantum computation. 

1.2.1 Space and qubit 

Classically, information is encoded in a binary string using a sequence of bits and 
1. Quantumly, information is encoded in a finite-dimensional complex vector space, 
endowed with the standard inner product, a Hilbert space using qubits. A qubit 
may exist in states |0) and |1), which are basis vectors for a two-dimensional space. 



or in any linear combination of these basis states with unit norm. We call the two 
vectors |0) and |1) computational basis for the two-dimensional Hilbert space since 
they correspond to the conventional bit representation of information. There are 
other pairs of basis states that span the two-dimensional Hilbert space but we focus 
on the computational basis. The state of a sequence of n qubits is a unit vector in 
the n-fold tensor product space C 2 ® C 2 ® . . . ® C 2 . This 2 n -dimensional space is 
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spanned by tensor products of states |0), |1). This is the computational basis for 
the n-qubit memory. The tensor product of two vectors |0) and is denoted as 
|0) <S> \ip). When these are computational basis vectors given by bit strings x,y, we 
may abbreviate the state \x) <g> \y) by \x,y) or simply \xy). The latter two make 
sense when x and y are bit-strings. Using a standard vector notation, a tensor 
product of two vectors is obtained by multiplying each entry in the left vector with 
the right vector. 




This extends in the natural way to tensor products of higher-dimensional vectors. 
The dual of the vector \i) is denoted by which is a row vector obtained by taking 
a conjugate transpose of For |0) this is just a row vector (1 0). 

1.2.2 Superposition and Measurement 

A Hilbert space of dimension n is spanned by n orthonormal vectors, and we can 
express a state in the space as a linear combination of these basis states. For a 
two-dimensional Hilbert space with the basis states |0) and |1), any state |0i) can 
be expressed as 

V l«o| + Fir 

where ctj is the amplitude of Similarly, a multiple qubit state is also expressed 
as a linear combination of its basis states. For example, a two qubit state Ifa) can 
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be expressed as a linear combination of four computational basis states, 

I0 2 ) = 7= = , = ======= («oo|00) + a O i|01) + « 10 |10) + a n |ll» . 

Vlaool + Foil + Fiol + Fur 

Definition 1 (Measurement [NCOO]) Given a set of n basis states {\m,j)}, a 
measurement in a basis \m) of a state \<p n ) = oci\mi) + . . . + a n \m n ) is a projection 
of \4> n ) onto one of the basis states by applying projective operators {\mi)(mi\} to 
\4> n ) . The superposition collapses to one of the basis states and the probability of 
obtaining \rrii) is (<f> n \ (\mi)(m,i\) \<f> n ) = \ai\ 2 . The state after measurement is then, 

\mi)(mi\4,„) 



The implication above is that before measuring \<f) n ), the state is in superposition 
of its basis states, but measuring collapses the superposition and gives only one of 
the basis states as an outcome with the probability according to the amplitude of 
the basis states in \<f> n ). Since the probabilities must sum up to one, this means 
that the sum of the squares of the amplitudes must also sum up to one, 



Also note that we normalize the collapsed state resulting from the measurement 
so that the squares of the amplitudes in this new state also sums up to one. 

For a multiple qubit system, we can also measure a small set of qubits only and 
leave the rest alone. A measurement in the computational basis of the first qubit 
collapses the first qubit into one outcome of the measurement, the remaining state 
is unchanged. Formally, the state is projected onto a subspace consistent to the 
measurement outcome. For example, if we have 



n 




i=l 



i 



(|00) + |01> + |10> + |11», 



V2 



measuring the first qubit gives with probability \ and 1 with probability \. On 
outcome 0, the new state is 



|^) = 4(|oo> + |oi», 
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-e- 



Figure 1.1: Diagrammatic Representations of X, H, and control-NOT respectively, 
and on outcome 1, the new state is 

|^> = ^(|10> + |11». 

1.2.3 Operators and Quantum Gates 

A quantum gate is a matrix that acts on the state vectors. In order for a matrix 
to be a legal (physically realizable) operator, it must be unitary, that is WU = I, 
where W is the conjugate transpose of a gate U. Some gates that are used for the 
construction of the algorithms in this essay are X, Hadamard H, and control-NOT 
gates. 

*-(!;)•*-£(! -0' c - not 

The effect of X on computational basis is a logical NOT operation, X\0) = |1) 
and X\l) = |0). A Hadamard, H transforms |0) into a uniform superposition of 
|0) and |1) i.e., and |1) into ■ Applying H for each of n qubits 

initialized to |0), we can create a uniform superposition of 2 n computational bases, 



/ 1 








\ 





1 
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2 n -l 

i.e., -j= ^ \x). C-NOT takes two qubits as inputs and conditioned on the first 

qubit, it performs a logical NOT operation to the second qubit, e.g., C-NOT|01) = 
1 01 } because the first qubit is 0, and C-NOT|ll) = 1 10) because the first qubit is 
1. It is a unitary operation corresponding to a classical gate. 

Recall that classically if we are given NOT and AND gates, we can construct a 
classical circuit for any boolean function. Such a set of gates is called a universal 
set of gates. Similarly, quantumly, we have universal sets of gates. This means that 
any unitary transformation on n quantum bits maybe approximated to within a 
specified e > 1 (in the spectral norm, say) by composing a sequence of these gates. 
One example involves the use of a C-NOT and a Hadamard with two additional 
one-qubit gates called a phase gate S, and tt/8 gate T. 

s= ( o °i ) ' T= ( A ) 
For the proof of the universality of this gate set, refer Section 4.5 in [NC00]. 

1.2.4 Quantum Algorithms and the Circuit Model 

A quantum algorithm consists of quantum registers that hold qubits, and a series 
of unitary operations described by a quantum circuit. The registers are initialized 
to |0) except for the input register which is initialized to the bits of the problem 
instance, as in a classical circuit. The circuit consists of a sequence of gates from a 
universal set of quantum gates with the labels of the qubits the gates are applied to. 
In Figure 1.2, the registers are represented by black lines. As we apply operators 
we move from the left to the right of the circuit. At the end of the algorithm, i.e., 
at the right end of the circuit, a measurement is performed on one or more qubits 
in the computational basis, which gives an outcome of the algorithm. An algorithm 
is said to compute a boolean function with bounded error if when the input string 
x is in the language, the algorithm accepts x (has outcome 1) with probability 
more than 3/4, and when x is not in the language, the algorithm accepts (i.e., has 
outcome 0) with probability less than 1/4. 

For example, suppose we want to implement an algorithm that flips the phase 
if the registers both contain |0), but not otherwise. Then Figure 1.2 performs such 
algorithm. It first applies an X gate to each register, and then applies a Hadamard 
gate to the second qubit, followed by a C-NOT conditioned on the first qubit, 
followed by a Hadamard on the second qubit, and finally applies X gates to two of 
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x 2/~ X - H 



X 



-e- 



H X - 



Figure 1.2: A Circuit that Implements a Phase Flip 

the qubits. This operator can be written as 

J-2|00)(00|. 

It is straightforward to check that both the circuit and the above matrix flips 
the phase of the qubit if they are both 0. 

The quantum time complexity of a (boolean) function is measured by the least 
number of gates required to implement an algorithm that computes the function 
with bounded error in terms of the size of the input. In Figure 1.2, the input size 
is two and the number of gates is seven. 

1.2.5 Query Model and Quantum Query Complexity 

We first formally define an oracle in terms of an operator. 

Definition 2 (Oracle) [NC00] An oracle O for a function f : {1, . . . , n} — > {0, 1} 

is a unitary operator that acts on a computational basis such that 



0\x)\q) = \x)\q®f(x)), 



where \q) is an oracle qubit with q G {0, 1} 7 which is flipped conditioned on x G 
{1, . . . ,n}, i.e., flipped if f\x) = 1. An oracle for a function with a larger range, 
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{1, . . . , n} is defined similarly, with O(logn) qubits each for the query and the func- 
tion value and © representing a bit-wise XOR. 

Using an oracle, we can perform a query algorithm, 

Definition 3 (T-Query Quantum Algorithm) [BBC+Ola] A T -query quan- 
tum algorithm A with an oracle O for function f is defined as 

A = UtOUt-! . . . OUiOUo, 

where all the transformation are defined on a three register quantum memory con- 
sisting of the query register, the oracle response register and workplace qubits for 
the algorithms. The Ui 's are unitary transformations independent of the function 
f , and the algorithm only depends on the function f through T applications of O. 

The query complexity of an algorithm is measured by the number of oracle 
operators we apply. The query complexity of computing a property P of the oracle 
function / is given by the least query complexity algorithm that computes P(f)- 

For a search algorithm where an oracle outputs f(x) — 1 if x is a target of the 
search and the property being if a given set contains a target element, we usually 
prepare an oracle qubit as ^7=^-, so that we get 




Since the oracle qubit does not change throughout the algorithm, we could 
simply think of this oracle as flipping a phase if f(x) — 1. 

What would be the action of O in a search algorithm? Suppose we have an 
initial state 

2 n — 1 

w = i E w. 

and that \ip) is a combination of two vectors, \a) and \(3), where the former is a 
uniform superposition of elements x such that f(x) = 0, and the latter contains the 
rest of elements. Then the act of applying the oracle is a reflection about the axis 
\a) because 

0(a\a) + b\P)) = a\a)-b\P). 

Recall the phase flip operator from the last section, which up to an overall sign 
of —1 is a reflection operator. Thus we can create the following reflection operator 
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by removing X gates in Figure 1.2. 

2|0)<0|-I. 

This construction extends in a straightforward manner to n qubits. In general, 
in order to implement a phase flip on qubits that represent n, O(logn) gates are 
required. 

We can create another reflection operator also called Grover diffusion operator 
that reflects the state with the axis \ip) by 

# ffin (2|0>(0| - I)H en = 2|^)M - /. 

Hence so far we have two reflection operators, O and 2\ip){ip\ — I. 

Lemma 4 Applying 

G = (2|^|-/)0 

is a rotation in a two-dimensional space spanned by \a) and \j3) by 29, where 9 is 
the initial angle between \ip) and \a) . 

Lemma 4 also holds for the composition of reflections of any two vectors. We 
will use this fact later in this essay. In Figure 1.3, the action of G is described 
geometrically. It first reflects about the axis \a), and then 0\ip) is reflected 
against the original state In this one step of G, there is only one query O. 
In Grover 's algorithm, this process is repeated 0(y/~N) times for N = 2 n so as to 
rotate the state of the query register close to in the worst case when there is 
only one x such that f(x) — 1, 9 « This gives a query complexity of 0(y/n). 

1.2.6 Reducing Error Probability 

In many quantum algorithms, we encounter a problem of reducing the error prob- 
ability from a constant such as 1/4 to polynomial close to 0. An algorithm is said 
to compute a function / with one-sided error given an input x if the following two 
conditions hold, 

1. If x is not in the language, it rejects with probability 1. 

2. if a; is in the language, it accepts with probability at least e > 0. 

This means that we have a probability (1 — e) of having a false negative. In order 
to reduce the error probability to at most 1/2, we repeat this algorithm for k = 
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r~ io g (i- £ ) l ~ e times, because 

i_ 1 

(1 - e) log < -. 
v y - 2 

During any one time in the k repetition of the algorithms, if the algorithm accepts 
x, we terminate and decide "yes". Most of the algorithms in this essay have one- 
sided error. For example, in element distinctness, if we find two different indices % 
and j such that Xi ^ Xj, we are sure it is in fact true. 

In our algorithms, we will often compose bounded error quantum algorithms. In 
such cases, a quantum algorithm is used as a subroutine in place of an oracle. We 
would have to amplify, by repetition, the success probability of the subroutine so 
that the overall algorithm succeeds. This results in an additional factor of O(logT) 
in the query complexity where the complexity with an ideal oracle is 0{T). Such a 
scenario is studied in [HMdW03] as a quantum search with a bounded error oracle. 
The main result in [HMdW03] is that we only need to invoke the oracle ^/n times 
as opposed to the obvious approach that gives ^/n\ogn. 

In this essay, whenever we have a one-sided error and we wish to amplify the 
success probability, we assume the procedure is modified as above. Moreover, if we 
have a case of imperfect oracle realized by a bounded error quantum algorithm, we 
apply the theorem in [HMdW03]. 



Chapter 2 
Related Work 

2.1 Quantum Walk of Szegedy 
2.1.1 Element Distinctness 

Recall from Chapter 1, the problem of Element Distinctness: given a function / : 
[n] — {1, 2, . . . , n} i— > [to], to > n, as an oracle, we want to test if / is one-one or not. 
If / is not one-one, we say there is a collision. That is, collide if f(i) = f(j), 
The function / can also be written as a list of numbers: / = (/i, / 2 , . . . , f n ). The 
goal of the algorithm is to answer this question with as few queries to the oracle as 
possible. 

The significance of this problem is that it is one of the applications of quantum 
walks that gives better bounds than classical counterparts. Underlying this quan- 
tum algorithm is a random walk. Ambainis was the first to adopt this classical walk 
into a quantum algorithm [Amb04b]. Classically, the straightforward algorithm to 
solve Element Distinctness is to go through the list one by one. Interestingly, this 
straightforward algorithm performs better than a random walk based algorithm 
classically which we will see in Section 2.1.2. 

Fact 5 Classical query complexity of element distinctness is 6(n). 

Since it is optimal an unordered search may be reduced to element distinctness. 
However, in quantum scenario, quantum walk based algorithm performs better 
than the above bound. Quantum walk based algorithm is a quantum version of a 
random walk based algorithm, which is described below. 
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2.1.2 Classical Walk Based Algorithm 

The following is a classical algorithm based on random walk for finding a collision. 
This walk is irreducible in the sense that there is a path between any pair of subsets. 

Algorithm 1 A Classical Walk Algorithm for Element Distinctness 

1: Pick a uniformly random set I of r elements out of [n] (call it an r-subset). 

2: Query / at points in /. 

3: if There is a collision within I then 

4: return "Collision found" and elements that collide 

5: end if 

{Walk on r-subsets of [n].} 

{(idea) Pick an element in the set and one not in the set uniformly at random 
(u.a.r.). Swap these elements. Note that we are maintaining the size of the 
subset.} 
6: for t < T do 

7: Pick i £ I and j £ [n] — I u.a.r. 
8: /<-(/- {l}) U {j}. 
9: Query fj. 

10: if There is a collision within / then 
11: Output the elements that collide. 
12: return 
13: end if 
14: end for 

15: print "No collision" {i.e., f is one-one.} 



Let T be the first time the walk "hits" an r-subset containing a collision (hitting 
time). 

Observation 

2 r — 1 1 

Pr (walk stops in two steps I any state) > 1 • • • . 

n — r r n — r 

This is because in the worst case, there are exactly two elements that collide 
with each other, and initially, we do not have any element that form a colliding 
pair in the r-subset. Next we pick one of the two colliding elements from n — r 
elements not in the set with probability In the second step, we first choose an 
element in the r-subset that is not part of the colliding set with probability 
and then we pick the other colliding element not in the r-subset with probability 
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first step second step 

Figure 2.1: The Probability of the Walk Stopping in Two Steps 

-^- r and swap these. In Figure 2.1, 1 and 2 are colliding elements not in the subset 
initially. It describes a sequence of transformation by which they are found by the 
algorithm. The hitting time for the walk is 



E{T)< 



(n — r) 2 r 
2(r-l) ' 



There are more sophisticated arguments that give a better bound on T . We 
will analyze hitting times more precisely. 

2.1.3 Hitting Time in Classical Walks 

Consider a Markov Chain on the state space X, (\X\ — N) given by the transition 
matrix P, where P = (p x ,y) , x, y G X and 

Px,y — -Pr (making a transition to y | current state = x). 

This corresponds to general Markov Chains, in the sense that if we are at x, we 
move to any arbitrary state y in the state space with probability p Xty . P is called 

a stochastic matrix, i.e., ^^p x , y = 1 for all x. So all the rows sum up to 1. 

y 

We assume that the Markov Chain is 

1. Symmetric: p x>y = p VtX . This makes the underlying graph of the walk undi- 
rected. 



2. Irreducible: There is a path between every pair of states. 
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Figure 2.2: An Example of a Periodic Markov Chain 

3. Aperiodic: There exists x G X and t x > 1 such that 

Pr(We reach x in t steps starting from x) > 

for all t >t x . Aperiodicity of the walk is equivalent to having an underlying 
graph that is not bipartite. This implies the same property for all y G X if 
the second property holds. 

What are the properties of such Markov Chains? 

1. Since P is symmetric, it is equal to its transpose, P = P T . So P is doubly 
stochastic; both rows and columns sum up to 1. 

2. Let s be any initial distribution, then s T P t i — > n T = (jj, jf, ■ ■ ■ ■> jj) ast^oo 
in the l\ metric. The distribution 7r is called stationary distribution, which is 
a fixed point in a Markov Chain. So we have uniform stationary distribution. 

3. 7r T P = 7r T = (jj, jj, . . . , jj): Ti is an eigenvector of P with eigenvalue of 1. 
Since P is symmetric, it is Hermitian, therefore it is diagonalizable and all 
the eigenvalues are real. Moreover, the other eigenvalues are strictly less than 
1: Ai = 1 > A 2 > . . . > A n > — 1. The eigenvalue of 1 is obtained from the 
irreducibility property. Aperiodicity implies all the eigenvalues are > — 1. 

In general, a marked state M C X is a subset. For element distinctness, M 
contains two colliding elements. Since we stop at a marked state, the transition 
matrix for this state is different from others. Suppose we would like to search for 
one of the marked states by simulating the walk and stopping when we see a state 
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x G M. The transition matrix now looks like 

Pm ={ P o /')' ^ 

where (P M P') are the rows of P corresponding to X — M , P M is P from which 
rows and columns corresponding to M have been removed. The rows corresponding 
to the states in M are (0 I) since once we reach M, we do not move to any other 
state. 

What is the hitting time of M? Let T be the hitting time for finding a marked 
state starting in distribution s. 

Fact 6 

E(T) = s T M (I-P M )- 1 -l, 

where sm is the projection of s onto X — M, and 1 T = (1, 1, . . . , 1). When M is 
non-empty, and since the Markov Chain is ergodic all the eigenvalues of Pm have 
absolute value less than I. Therefore the expression is well-defined. 

oo 

Proof : For any non-negative integer-valued random variable T, E(T) = Pr(T > 

t=o 

t). In our case, Pr(T > t) is the probability we have not reached the marked state 
after t steps. This is also the probability that we are still in one of the states in 
X — M. Since the state distribution after t steps is s T P t , where 

pt - ( P t F f ) ■ 

Let 1x-m denote a vector that contains 1 for the first \X — M\ entries and 
for the rest. Then we have 

Pr(We are not in a marked state after t steps) = s T P t l X -M = s m-Pm1- 
Then, 

oo 

E(T) =Y,s T m Pm^ 
t=o 



= s 



M 



t=0 



= s T M {I - Pm)- 1 !. 
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□ 

Stationary distribution for P is a uniform distribution over all elements. Thus 
by judicially choosing initial state to be the stationary distribution, we get a good 
bound on hitting time. 

Corollary 7 a. If s — (-^, . . . , -^), then hitting time E(T) is 

E(T) = ±. 1 • (I - Pm)- 1 ■ 1. 
b. Let 1 M = ^'jjlp ■ If the eigenvalues/vectors of Pm are (Aj,i>j) and 1m = 

N-m 

ViVi then, 

i=i 

N-m 1 

e{t) = ^(rnr)' (2 - 2) 
i=i 1 _ Ai 

where N is the normalization factor, m = \M\ is the size of marked subsets, and \ 
is the i-th largest eigenvalue of Pm in magnitude. 

Note in the first part of Corollary 7, the eigenvalues of (I — Pm)' 1 are jzry, for 
each eigenvalue Aj of Pm- Also since we are working with real symmetric matrices, 
all the numbers z/« are real. 

The matrix Pm is real, all the absolute values of eigenvalues of Pm are strictly 
less than 1, and along with the symmetry of Pm, it is orthogonally diagonalizable. 
This means we can choose Vi such that they form an orthonormal set. Note that 
the spectral norm of a matrix is the largest singular value of the matrix. Since Pm 
is symmetric, it is equivalent to the largest eigenvalue. Hence ||Pm|| — Since 

v\ is at most 1, we have, E(T) < ^ = iqfap 

In order to bound the hitting time, then we need to bound the largest eigenvalue 
of P M - 

Lemma 8 ([Sze04b]) If the spectral gap(= 1-X 2 (P)) of P is > 5, and ifj^j > e, 
then || P M || < 1 - | . 

In the above lemma, we define A«(P) to be i-th largest eigenvalue of P in mag- 
nitude. Note that since P has a uniform distribution, Ai = 1. So the spectral 
gap, which is formally, the difference between the largest and the second largest 
eigenvalues in magnitude, is Ai(P) — A 2 (P) = 1 — A 2 (P). 
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Figure 2.3: Pij Moves from One State to Another With a Symmetric Difference of 
Two 

To bound the hitting time of the walk , we would like an explicit formula for 
the spectral gap of P to compute the upper bound of the spectra for P M . Recall 
the state space of the walk is X = {r-subsets of [n]}. Given an r-subset, there 
are r(n — r) other r-subsets to transition to by swapping one of the r elements in 
the current subset with one of the n — r elements not in the subset. Each of these 
r(n — r) subsets have equal probability of being moved to from the current r-subset. 



where J n ,r,r-i is a boolean matrix with entry 1 iff i and j are subsets of size r, 
whose intersection is of size r — 1. 

Theorem 9 ([Knu91]) There arer + 1 eigenspaces of J n ,r,r-i, eigenvalues corre- 
sponding to 



Then 




and 




X j = (r- j)(n - r) - j(r - j + 1), < j < r. 



We have r < |, otherwise we have a high probability of solving the problem in 
Line 3 in Algorithm 1. Also, Xj is a decreasing function of j. The eigenvalues 
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are not all positive, e.g., for j = r, we have A r = — r. However, we are only 
interested in the first and the second largest eigenvalues, which are, Ao = r(n — r) 
and Ai = r(n — r) — n. Since these are eigenvalues for J n , r , r -\ and P = ^z^y, 

the second largest eigenvalue for P is r ^~^ r ~p . From these, we can compute the 
spectral gap: 1 t^J-^ = , n ? > -. Remembering that M is the set of r-subsets 

" ° r r(n—r) r(n—r) r ° 

that contain a colliding pair of elements, in order to lower bound the fraction of 
marked elements, we need to consider the worst case scenario where we have exactly 
one pair of colliding elements. 

\M\ > C2J = r(r-l) > r 2 



\X\ ~ Q) {n-r -2){n-r -1) ~ 2n 2 



for r = o(n) when approximation involved. 
From this, we have 



\Pm\\ <1- 
= 1 - 



r 2 
2» 2 



2 

r 

4n 2 ' 



So 



. , 1 4n 2 ^ /n 2 



This is a bound on the hitting time of the algorithm. The query complexity of 
the algorithm is calculated as follows. We need to make r initial queries for the 
values of each element in the initial r-subset. At each of the 0(^-) iteration of the 
walk, we need to query the value of the new element we swapped into the subset. 
Thus, we have 0(r + — ) query complexity. This is minimized when r — n and 
gives 0(n) query complexity. This is equivalent to checking every element in the 
entire set, thus giving no speedup to the straightforward algorithm of sequentially 
checking every element in the set. 

Now we are interested in the quantization of the classical algorithm we have 
discussed thus far. 



2.1.4 Quantization of the classical walk 

The first quantization of random walk in Algorithm 1 was proposed by Ambai- 
nis [Amb04b], which is described in Section 2.2. A new kind of quantization of 
classical walks was proposed by Szegedy [Sze04b], which we present here in detail. 
The walk is over a bipartite graph. Each side of the graph contains r-subsets as 
vertices. A pair of vertices in the left and the right hand side of the graph are con- 
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Figure 2.4: A Bipartite Walk 

nected only if one can be changed into another on the opposite side by removing 
one of the elements in the subset and adding one that is not in the current set. This 
is equivalent to having two vertices connected if they differ in exactly two elements. 
The probability of moving from a subset x in the left side of the graph to a subset y 
in the right side of the graph is given by p XtV . For each side of the graph, we create 
a state, 

\4>x) = Yl Vp^\ x )\v) 

y 

for the transition from x on the left side to all of its neighbors y on the right side 
of the graph, and 

X 

similarly. Note that, 

a. {<Px}x and {ip y } y are orthonormal sets because each \x) and \y) are distinct. 

b. Let Ei = span{(f> x } x and E 2 = span{%l)y} y and 7Ti, tt 2 to be orthonormal 
projections onto E±, E 2 respectively. We define two unitary operators R\ and R 2 
as i?i = 2^-1 and R 2 = 2TL 2 - I. 

Then R± is unitary because it can be implemented using the combination of 
unitary gates similar to the reflection operator in Section 1.2.4. We can see that 
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Ri is actually a reflection operator about the space E\ because 

R^ip) =(2^-7)1^) 

= (2^|0x)(0x| -I)\<p) 

x 

= Y J ^x)^x\-i)W) 

CP. 

\(p) if (p e Ei 
-\tp) i£<peE£ 

Similarly R 2 is unitary and is a reflection about the space E 2 . 
Definition 10 (Quantitization of a M.C. P) 

W P = R 2 Ri 

Why is this a natural definition? The straightforward way to define a step of 
the walk is 

y _ 
\ x ) Y VP^~y\y) Y VPy^\ z ) 

y z 



\x) 



But this is just a simulation of a classical walk. Instead, we keep the memory 
of the previous step only. To do so, we need an operator that diffuses x into all the 
neighbors y and vice versa. 

Another way to see that the definition of the quantized walk is natural is to 
look at the Grover diffusion operator as an operator to move from a vertex in a 
complete graph to one of the adjacent vertices with equal probability. This idea 
was introduced in [WatOl]. Algorithm 2 describes Szegedy's algorithm. 

Szegedy defines the quantum hitting time as follows. 

Definition 11 ([Sze04a]) T is an c- deviation- on- average time with respect to \<f>) 
if 



^El|wW-M||!>c 



T + _ 

t=o 



This was defined so that after T steps of the walk, the average deviation of 
the initial state is very high. That is, the state is significantly skewed towards the 
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Algorithm 2 Szegedy's Quantization of a Random Walk 

1 - 1 

1: Let |0 O ) = ^J2 Mx) = 77^Y1 VP^\ X )\V) = -f^Yl \<t>v) 

x V x,y y 

2: Measure if the first register is marked or not. 
3: if The first register is marked then 
4: return ' 'Found marked element.'' 
5: end if 

6: Apply H <g) I to |O)|0 O i) to get ^(|O)|0 O i) + | Woi» 
7: Pick t u.a.r. from [0, . . . , T] 
8: for i < T do 

9: Apply controlled- Wp conditioned on the first register. 

{The modified matrix P in Equation 2.1 lets us remain in the same state 
once we arrive at the marked state.} 
10: end for 

{ Now we have ^(|O)|0 O i) + |l)^|0oi))} 
11: Apply H <g) I to get ±|O)(|0 O i> + W%\4> Q1 )) + ||l)(|0oi) - Wj|<A)i» 
12: Measure the first register. 
13: if There is a T' in the first register then 
14: return ' 'Detected marked element.'' 
15: else 

16: return ''No marked element.'' 
17: end if 
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marked state and so the probability of observing the marked state is high. Since the 
walk is realized by unitary evolution it cannot end up in a marked state. Instead, 
it can cycle through states with high amplitude on marked states. 

Next we compute the complexity of this algorithm. One step of the walk is R\ 
followed by i?2- We show here that R\ can be implemented efficiently i?2 can be 
implemented similarly. Recall that 

R l =(211! -J) 

= ^2^1^(0.1 -?j 

= ^(2|0,)(0,|-/,), 

X 

where I x is identity on \x) (g> C x . The last line follows from the fact that we are 
working on 

C XxX ^ C X C X 

= ® x \x)®<C x , 

so Ri which acts on C x (g>C x can be decomposed into the direct sum of \X\ diffusion 
operators, 2\(/> x ) {(/> x \ — I x . Since this is the reflection in \x) <8> C x about 10,), this 
can be written as 

IxXarl <8» (l/ x (2|l5><tJ| - 7)C^) , 

and 2|0)(0| — / can be implemented using 0(log|X|) gates similarly to the con- 
struction of a circuit for 2|00)(00| — I in Section 1.2.4. 

From the above argument, we see that if there is an efficient procedure to 
implement the transformation 

(i®u x )\x)\o) = \x)J2Vp^\v)> 

y 

then the algorithm can be implemented efficiently. 
2.1.5 Hitting Time in Quantum Walks 

In order to analyze the deviation time, it suffices to take a look at the eigenvalues 
and eigenvectors of one step of the walk. This is because for a unitary operator 
U = ^~]e %ej \vj)(vj\ with {\vj)} being the orthonormal eigenvectors of U, U l = 

j 
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e l6jt \vj) (vj\ so it has the same set of eigenvectors. 

3 

Recall that W P = R 2 R\ = (2TL 2 — /) (2IIi — I), where IT is an orthogonal 
projection to Ei, and E\ is the space spanned by \<p x ) and similarly for E 2 . Let 
A = S ^\4>x){x\ and B = \(f) y )(y\. Note that the dimension of the space in 

x y 

which \(f) x ) lies is N 2 and that of (x\ is N. Then we can write W P = R 2 Ri as 
(2AA* - I)(2BB* - I) because III = ^1^X0*1 = AA \ and similarly for U 2 . 

X 

Note that A and 5 are norm-preserving operations because A^A = I = B^B and 
A maps a vector in C x into its subspace E 1 and similarly for fi. 

Suppose a vector t> e (Ei + E 2 )~ L . Then f lies in the space orthogonal to both 
E\ and E 2 . Since Ri reflects a vector orthogonal to E^, then f is reflected by both 
Ri and R 2 . Then applying a walk operator Wp = R 2 R\ does not change v. Hence 
Wplf) = |i>), and the subspace spanned by such t>'s is an invariant subspace, an 
eigenspace with eigenvalue 1. Thus we only need to analyze the behavior of W P in 
Ei + E 2 . Suppose we have \w), \v) G C x , then we want to analyze the action of R 2 
on A\w) and the action of Ri on B\v). (Since A\w) E E 1 , the action of R\ on A\w) 
is identity.) Since 

R 2 A\w) = 2U 2 A\w) - A\w) 

= 2B(B^A)\w) - A\w), 

where the first term of the last line lies in E 2 and the second term in Ei, and 
similarly, 

R 1 B\v) =2A(A*B)\v)-B\v), 
we define the discriminant of A and B as follows. 

Definition 12 The discriminant matrix D of A and B is D = A^B. 

Theorem 2.1.1 If D = dj\ w j){ v j\ ^ s the singular value decomposition of D, 

j 

then 

a) 0< 8j < 1. 

b) The space generated by {Awj, Bvj}, for all j where (wj, Vj) is a pair of singu- 
lar vectors is invariant under Wp. And Wp restricted to this space is a composition 
of a reflection about Awj followed by a reflection about Bvj . 

Let the angle between Awj and Bvj be 0j, that is the singular value correspond- 
ing to Awj and Bvj be {wj\A^B\vf) = (wj\D\vj) = cosOj, for 9j G [0, n/2]. Recall 
from Lemma 4 that a product of two reflections about two reflectors \<f>) and 
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is a rotation by an angle 29, where 9 is an angle between the vectors \<p) and \ip). 
Similarly then Wp is a rotation by 29 j in this subspace. 

Proof : [Theorem 2.1.1-a] Singular values are taken to be real and non-negative 
by convention, so 5j > 0. Since A, B are norm-preserving \\Aw\\ = \\w\\ and 
\\Bv\\ = \\v\\. Using these facts, 

8j <max {la)m \{(3\D\a)\ 
= max { | Q)) |^ >} \((3\A*B\a)\ 
<mzx {la)m \\A\/3) || -\\B\a) \\ 
< 1. 

Hence Sj < 1. □ 

Proof : [Theorem 2.1.1-b] As we mentioned before, we only need to consider the 
action of Wp on Awj and Bvj and that Awj is invariant by Ri and Bvj is invariant 
I'.v Hi. 

W P \A Wj ) = (2n 2 - I)A\ Wj ) 

= (2(BB^) - I)A\wj) 
= 2B(BtA\wj))-A\wj) 
= 2BD^\wj) -A\wj) 
= 26jB\ Vj ) - A\ Wj ) 

= (cos9 j B\v j )) - (A\wj) - cos9 j B\v j )). 

The first term in the last line is the component of A\wj) that is along B\vj) 
and the second term is the component of A\vjj) that is orthogonal to B\vj). So, on 
A\vjj), R\ is a reflection about A\wj) (identity in this case), and R 2 is a reflection 
about B\vj) (because of the orthogonal component) in the subspace. Similarly, on 
B\vj), Ri is a reflection about A\wj) (because of the orthogonal component), and 
R 2 is a reflection about B\vj) (identity) in the subspace. □ 

Using Theorem 2.1.1, we are ready to estimate the deviation time with the 
initial state, 

xex-M y ex 

= Vn E l^>> 

xex-M 

because |0 O i) is the state that remains after the measurement in Step 3 of Algo- 
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rithm 2. We would like to bound T such that 



- \M\\ 2 > c(l - e) 

t=o 



for some small positive constant c and e being the fraction of marked elements. This 
is because by Szegedy's definition in Definition 11, the hitting time is the time it 
takes for the state to be significantly different from the initial state, greatly skewed 
towards the marked state. That is, we want the L 2 norm of the difference between 
the final and the initial state to be at least as large as the fraction of unmarked 
elements. This ensures that when we measure the final state, we detect a large 
deviation from the initial state. The term in the summation is 2(1— e— (0oi| Wp|0oi)) 

because |0 O i) = —fj^\4>x) and so |||0oi)|| 2 = 1 — e, so we need to upper bound 



xdX-M 



T+l 



W|W£l0oi) 



t=0 

Now 

D(x,y) = (x\D\y) 



(x\AB\y) 



= (x 




then for Dp, the (x, y) entry is ^p x , y ^p y ,x- Since if exactly one of x or y is marked, 
\[Vx~y or \f¥y~x is 0, the entry of Dp is zero if exactly one of x or y is marked. Also 
since Pm, and / are symmetric, for (x, y) both being unmarked or marked, we have 
Pm or / respectively as diagonal blocks in Dp. So 



Dp 



Pm 
/ 



Now we are ready to use Theorem 2.1.1. Let the normalized eigenvectors of 
Pm be {v' k }k with eigenvalues and denote Vk for v' k padded with to make 
an eigenvector of Dp. Since Pm is symmetric, all the eigenvectors are orthogonal. 
The rest of the eigenvectors of Dp are {la:)}^ for x G M. These vectors are also 
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orthogonal to each other, and all n eigenvectors of Dp are orthogonal to each other 
as well. So its eigenvectors are the singular vectors and the absolute value of the 
eigenvalues give the singular values because some eigenvalues may be negative. 
Then from Theorem 2.1.1, the invariant subspaces of Wp are the subspaces F k 
spanned by the pairs (Avk, Bvk) with singular value |Afc| for all k and the subspaces 
F x spanned by the pairs (A\x), B\x)) with singular values 1 for all x G M. Since 
the product of two reflections is a rotation as we have seen before, the action of 
Wp is a rotation of the subspace F k by 29 k , where 9 k is the angle between Av k and 
Bv k : 

(v k \A j B\v k ) = cos9 k , 

This is also equal to the singular value of A^B = D corresponding to v k , 9 k = 
cos -1 |Afc|. Also 9 k G (0, 7r/2] cannot be zero because cos9 k = \X k \ < 1 and from 
Theorem 8, \\Pm\\ = Ai = 1 — 4f < 1 assuming that e ^ 0. So Wp rotates the 
subspaces by 29 k t. 
Observation 

a) 

\4>oi) e span{Av k } k 

because 

x&X-M 

xeX-M \ z / 

= Vn E 4*> 

xex-M 

and \x) G span k {v k } for n G X — M. So |0 O1 ) is spanned by Av k for all k. 
b) 

|||0oi>|| 2 = l-e 

because 

IH0oi)|| 2 =i E (^i^> 

_ \X-M\ 
~ N 

= 1 -e. 

So we normalize the initial state and also write this as the linear combination 
of the spanning set, 

-^t = 5>AK>. (2.3) 



CHAPTER 2. RELATED WORK 



29 



Note that the square of the amplitudes sum up to 1 so v\ — 1. Let \z k ) = 

k 

A\v k ). Then \z k ) are orthonormal to each other because \v k ) are orthonormal to 
each other. A preserves inner products: 

(zkM =A^A{v k \v k ,) 
= (v k \A*A\vk>) 
= (vk\v k >) 
= 0. 

v 2 

Claim 13 IfT> 100 V 



rTT E II^pI^oi) - I0oi) II 2 = 2 ( (1 - e) - 5^ E^oil^oi) 



t=0 \ t=0 

>c(l-e), 

/or some constant 2 > c > 0. 
This means 

^tE^iI^I0oi)< (l-0 (1-6). 

t=0 

Proof : From Equation 2.3 

(001 1 1 001 ) = !=i E ^ ^ ^ fel I ^ I Zfc 2 ) 

because |^)'s are orthonormal to each other and belong to orthogonal eigenspaces of 
Wp. The last line is obtained from the fact that the angle between \z k ) and W p \z k ) 
is 29 k t since one step of Wp rotates the subspace by 29 k , and so {z k \W p \z k ) = 
cos(29 k t). 
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Now we use three mathematical identities to bound ^cos (29 k t). First, 



t=o 



cos (W) 



e iut + £ -iut 



So the sum of cosines is a sum of two geometric series. Using the formula for 
the sum of geometric series we have, 

i ^ u lw t u N <h 1 cos (29 k T) - cos (29 k (T +!)) + !- cos (29 k ) 
-^(falWpfa) <(l- c )^„ fc _ 2(1 -cos (20*)) • 



T+l 

t=0 



(20*)) 
Next, we use 

| cosa — cos/3\ <\a — f3\ 
to bound the numerator and use 

a 2 

cos a < 1 — — for a £ [-3.79,3.79], 
8 

to bound the denominator. Note that here, a = 29 k and 9 k e (0, |] as mentioned 
before, so the third inequality can be applied. 
Using these, we get 

T 

^lE^il^l0oi) <(1-^E^ 2 26k + 26k 



, 2((2 W /8 

2 



< (1-^)4- 



^ 2 



The last line comes from the fact that we have chosen T > 100 So the 

claim holds for (l — | ) = ^ or c = ||. □ 

We can relate the hitting time of the walk with the eigenvalues of Pm- 

Corollary 14 c-deviation on average time for Wp with respect to Hf= is O 



^1-\\P M \\ 
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vl 



Proof : We know that T > 100 V and that cos# fc = |A fc |. Then 

9k > sin 9k = a/1 — cos 2 9k 



because X k < 1. 
So, 



x 1/2 10 Q 

t < ioo 2^ fc = < -^=^= ^ 

k V 1 _ A fe V 1 - ll P ^ll fe 



_ ?/r . 100 

fc ' 

because v\ — \ and any eigenvalue in P M is at most the largest eigenvalue of 

k 

P M which is IIPmII- This means that the hitting time TeO , 1 I . □ 

" " 6 \^1-\\Pm\\ ) 

Recall that the classical hitting time for a symmetric transition matrix is O {j^p^j > 
so using Szegedy's walk we have quadratic speedup. 

Theorem 15 ([Sze04a]) For the quantum walk based on a transition matrix P 
with eigenvalue gap of 5, the fraction of marked elements |M|/|X| at least e, in 

time O (l/y/5ej, Algorithm 2 detects a marked element with probability at least 
if it exists, in 0(1 /V6e) application ofWp. 

Proof : If a marked element exists, either a marked element is detected in 
Step 2 with probability e, or a deviation is detected in Step 11 with probability 

T 

— 4Cr+ii 1 1 TVj, | <^>oi ) — 1 0oi ) II 2 > — (1-e). Then the net probability of success is 
t=o 

e + iK 1 ~~ e ) - M + W- Here > T = °( ~^Ti \ P , || ) from Corrollar y 14 and Lemma 8. 
□ 

The consequence of Theorem 15 is that it suffices to analyze the classical version 
of the walk in order to bound the quantum hitting time. Suppose we have three 
different costs, time or query, associated with a classical walk based algorithm. A 
setup cost, s(r), an update cost u(r) and a checking cost c(r). A setup cost is the 
cost required to set up the initial r-subset, an update cost is the cost to maintain 
the data pertaining to the r-subset during the walk, and a checking cost is the cost 
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needed to see if we have a marked subset. Then the total quantum complexity of 
this algorithm is 

s(r) + — (u(r) + c(r)) . (2.4) 

Throughout the rest of the essay, we will describe the classical versions of the 
algorithms to obtain quantum upper bound. 

As an application to element distinctness, using Theorem 15, we get 0(n 2//3 ) 
bound. If we use the classical walk, however, we get a query complexity of 0(n 4 / 3 ), 
which is worse than the straightforward algorithm that gives 0{n). Because in 
quantum case, we have a smaller hitting time, a walk based approach performs 
better. 



2.2 Quantum Walk of Ambainis 

There is another quantum walk algorithm proposed by Ambainis [Amb04b] to solve 
Element Distinctness, which came prior to [Sze04b, Sze04a]. This is generalized 
in [MSS05, CE03] to solve any A;-collision problem and is called Generic Algorithm. 

Definition 16 (A>collision) [CE03J Given a function f on a set S as an oracle 
and a k-ary relation C C S k , find a k-tuple of distinct elements (ai, 02, ... , at) G S k 
such that (/(ai), /(a2), . . . , f(dk)) £ C if it exists. Otherwise, reject. 

In the circuit for the generic algorithm, there are three main registers, a set 
register, a data register and a coin register. The set register holds a subset / of 
the set S, of size r or r + 1. The data register holds the data corresponding to 
the set in the set register. The coin register holds an element of S — I. In element 
distinctness, for example, the set register contains indices of elements % in r-subset, 
the data register contains the actual value Xi for each element in the set register, 
and the coin register contains the indices j's that are not in the set register. 

The walk starts with a uniform superposition of r-subset in the set register and 
sets up the corresponding data register as in Szegedy-walk. Unlike Szegedy-walk, 
this algorithm also sets up a coin register C . At each step of the walk, if the subset 
is marked, i.e., contains a fc-tuple in C, then it flips the phase by applying a phase 
flip operator similar to the one in Section 1.2.4. Then it enters quantum walks to 
flip the coin. It diffuses the coin register over indices in S* — J by applying a Grover 
diffusion operator similar to the one in Section 1.2.5 and adds the element from 
the coin register to the set register. Now the size of the set register is augmented 
to r + 1. Then it diffuses the set register over J, and removes one element from 
the set register. Note that during this diffusion step, the data register is updated 
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correspondingly. This process is repeated for some time before checking the subset 
for a marked state. When the size of r-subset is 1, this is analogous to what Grover's 
algorithm does. 

Similarly to Equation 2.4, we can write the expression for the total cost of 
Ambainis-walk using a setup cost, an update cost and a checking cost from the 
classical walk, 



One of the differences between Ambainis-Walk and Szegedy-Walk is that in the 
former, checking takes after \fr steps of the quantum walk, whereas in the latter, 
checking takes after every step of the walk. Also, in the former, the walk is over a 
graph, in which the vertices are a subset of size r or r + 1 and they are connected iff 
the size of the vertex differ by 1 and the symmetric difference is two, whereas in the 
latter, the walk is over a bipartite graph, and each side of the vertices are subsets of 
size r, and they are connected iff the symmetric difference is two. We shall see later 
how these differences affect the performance of an algorithm for different problems. 



Suppose we are given an oracle for the adjacency matrix of a graph. It takes two 
vertices in a graph as inputs and outputs 1 if the vertices are connected by 
an edge and otherwise. We are promised that there is exactly one clique of size 
three, called triangle, or none at all. Our goal is to test which case holds for an 
undirected graph Q with as few queries to the oracle as possible. 

For Q with n vertices, classical lower bound is f2(n 4//3 log 1 / 3 n) [CK01]. Quan- 
tumly, the lower bound is Vl(n) [MSS05], by the following argument. Suppose there 
is a graph Qi, that is formed by adding an extra edge to one pair of the n leaves 
in a star graph, Q 2 . Then there are n 2 possible triangles in Q\. We are given an 
oracle for the edges in Q\\ it answers "yes" in input if it is part of the graph. 
The goal is to find an edge in Q x that is part of Q\ — Q 2 . Using a lower bound 
for unordered search over n 2 edges this takes Q(n) quantumly as we prove later 
in Section 2.4.3. Now such an edge forms a triangle in Q\. So if we are given an 
algorithm for triangle finding, we could also find an edge in Q\ — Q 2 - Hence the 
quantum lower bound for triangle finding problem is Q(n). 

A straightforward quantum upper bound is 0(n 15 ) by running Grover search 
on n 3 triplets of vertices, querying three times at each iteration. Here we present 
an algorithm of Magniez, Santha, and Szegedy [MSS05] that uses Ambainis-based 
quantum walk and queries the oracle 0(n L3 ) times. We also present an algorithm 




2.3 Triangle Finding Problem 
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that uses Szegedy-based quantum walk to compare its performance with Ambainis- 
based quantum walk algorithm. We will also point out why there is a difference in 
performance between the algorithms that use these two different quantum walks. 

2.3.1 0(n 1 3 ) Algorithm Using Ambainis Walk 

Recall from Section 2.2 that the query complexity for solving /c-Collision for a set 
of n elements by performing a quantum walk on r-subsets is, 



The approach in [MSS05] consists of an outer algorithm A Q and a subroutine 
A s . The input of A Q is a set V of n vertices. The output of A Q is a pair of vertices 
in V that is part of a triangle if there is one, "reject" otherwise. The input for A s 
is a set of r vertices, V s and their adjacency matrix as well as a vertex v that is not 
necessarily in V s . The output of A s is an edge called Golden Edge in the adjacency 
matrix for vertices in V s that together with v forms a triangle. Then in order to 
find a triangle edge in the subset in A Q , we only need to feed each of the n vertices 
in V and an adjacency matrix for a subgraph induced by an r- subset into A s . A 
further modification is that using Grover's search algorithm, we search for a vertex 
that forms a golden edge by repeating this algorithm sfn times instead of n. 

Next, we analyze the query cost of A s and then A a . Remember that in A s , we 
are given the adjacency matrix of a set of vertices V s of size r. We perform a walk 
on s-subsets of [n] to find a golden edge in V s . We create a subset of size s out 
of r vertices, and query if each of s vertices is connected to the given vertex v, 
because v might come from outside this set V s . This setup cost is then 0(s). At 
each step of the walk, we get a new vertex from V s into the subset of size s, but 
in order to check if there is a golden edge in the subset, we only need to query if 
the new vertex is connected to v. So the update cost is 1 and the checking cost is 
0. The parameter k for this instance of /c-collision is 2, because we are looking for 
two vertices that form a triangle with v, giving the total query cost of the order of 



The outer algorithm A D performs a walk on r-subsets of vertices of V Q . The 
data are the adjacency matrix of the subgraphs induced by the r-subset. Initially 
we need to query r 2 times to set up an adjacency matrix of the subset. At each 
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step of the walk, we insert a new vertex in the subset and remove one from it. 
We update the adjacency matrix for the new vertex, which costs r queries. For 
detecting a golden edge, we invoke \fn times the subroutine A s that costs r 2 / 3 . 
Hence the checking cost is y/nr 2 / 3 . The parameter k = 2 because we are looking 
for two vertices that are part of a triangle. Hence the total cost is, 




This is minimized when r = n 3 / 5 giving 0(n L3 ) query complexity. 



2.3.2 Szegedy Walk Does Not Perform Better 

Does using Szegedy- Walk give any advantage in query complexity for this problem? 
Suppose the goal of the outer algorithm A Q and the subroutine A s are the same as 
in [MSS05]. Then for A s , the setup cost, update cost and the checking cost do not 
change. Using s as the size of the subset and r as the number of vertices in A s , 5 is 
1 / s from Theorem 9, and e is the probability that we have two vertices that form 



-2> 



a golden edge with v, so e = Ks ^ J ~ ^, for s e o(r). Then the total cost is, 

minimizing this gives r 2//3 when s = r 2 / 3 , which is exactly the same as in [MSS05] 
described in Section 2.3.1. 

For A Q , the setup cost, update cost and the checking cost are as same as 
in [MSS05]. Using r as the size of the subset, 5 is 1/r from Theorem 9, and e 
is the probability that we have two vertices that form a golden edge in the r-subset 

V s . So e = Kr ^ J ~ ^j, for r G o(n). Then the total cost is of the order of, 

r 2 + -^=(r + y/nr 2/3 ). 

However, this gives 0(n L5 ) query complexity for r = 0(1), the same as the 
straightforward application of Grover's search and worse than in [MSS05]. 

It turns out for the same setup, update and checking cost, we can easily see 
which algorithm will perform better [Mag05]. Compare the formula for A;-collision 
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using Ambainis-Walk 

<r) + {^) k/2 (c(r) + ^Fu(r)) 
= s(r) + ^c(r) + ^u(r), 



with the one for Szegedy-Walk, where 5 — - and e = 




s(r) + ^^(c(r) + u(r)) ^ 

= s(r) + J^c{r) + -g^ u ( r ). 

From these we see that Ambainis-Walk always performs better because the 
second term is always better than Szegedy-Walk, while other terms are the same. 
This allows us to have a higher query cost for checking, giving an improvement over 
the straightforward 0(n 1,3 ) upper bound for triangle finding algorithm. 

There are other algorithms that use Szegedy-Walk, such as an algorithm that 
performs a walk based on edges. However, so far all these algorithms give the same 
0(n 15 ) query upper bound. 

2.4 Adversary Method for Query Lower Bounds 

In this section, we describe one of the popular methods to derive lower bounds for 
quantum query complexity. Later in this essay we apply this technique to derive 
lower bounds for the problems we are studying. 

2.4.1 Quantum Adversary Theorem 

Suppose an oracle takes an input i and produces Xi to form a string x = (xi, X2, ■ ■ ■ , Xn) 
{0, 1}. Furthermore, suppose there is a boolean function that takes the string x 
as an input and produces an output f(x). For example, in unordered search, the 
oracle takes an index % and outputs Xj. The boolean function f(x) is the logical 
OR of all xf. f(x) = VjXj. We want to lower bound the number of queries needed 
to decide f(x). 

Theorem 2.4.1 [AmbOOj Let A C {0, 1}^ be a set such that every string in the 
set maps to under f , and let B C {0, 1}^ be a set such that every string in the 
set maps to 1. Suppose that 

1. For all x G A, there exists m different y G B such that yi ^ Xi for exactly one 
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2. For all y G B , there exists m! different x G A such that x-i ^ yi for exactly 
one i. 

Then Cl(y/mm') queries are required to compute f . 

Proof : Suppose we have a t query bounded error algorithm for computing /. In 
order to lower bound t, the number of queries needed, we take a look at W t , the 
sum of all the inner products at the end of t-th query over all pairs of x and y that 
satisfy the relationships stated in parts 1 and 2 of the theorem: 

w t = MM)- ( 2 - 5 ) 

(x,y)£R 

The proof estimates the difference \Wt — Wo\ and \Wj — Wj-i\, the difference made 
after each query to the oracle in terms of \R\. 

Let \ip 3 x ) be the state of the algorithm after the j-th query if queries were an- 
swered according to the input x = (xi,x 2 , ■ ■ ■ ,xn). We are interested in (V^l^)) 
i.e., how much the states will differ after j queries if x is taken from the set A and 
y is taken from the set B. For this inner product, there are two simple things we 
know about, 

Property 1 <^J> = I. This is because |^> = \4>° y ) = ^ start)- 

Property 2 At the end of the algorithm, the inner product must be small: After 

t queries, KV'xlV'y)! — c f° r a constant c < 1. 
Proof : The proof of Property 2 above follows from the lemma, 

Lemma 17 ([AKN98]) If \{ipi\ip2)\ > 1 — e , then for any measurement M and 
any outcome i, the probability of finding i when measuring \ipi) and \ifj 2 ) differs by 
at most \plk. 

Suppose there is an algorithm with the probability of obtaining correct outcome 
greater than or equal to |. The probability of having an outcome is at least | 
if we have an input x such that f(x) = 0. The probability of obtaining is less 
than | if we have y such that f(y) = 1. This means that if we have x G A and 
y G B and measure the final state |^*) and \ipy) then the probability of measuring 
differs by at least 1/2. So 




Therefore, the inner product KV'il^)! differs by at most | < 1. □ 



CHAPTER 2. RELATED WORK 



38 



Property 3 From Property 1, we know W = (V'olV'o) = 1 = l-^l> 

(x, y )eR (x,y)eR 

where \R\ > \A\m, \B\m! '. 

Property 4 From Property 2, we know that after the last, t-th query, each of the 
inner product is at most | in absolute value, so \W t \ < ||-R|. 

Lemma 18 \W j - W,_i| < -t=\R\. 

' J J ' v mm' 1 1 

We will provide a proof of Lemma 18 shortly. From Property 3 and Property 
4, we get \W t — W \ > ||-R|, that is queries performed during the entire algorithm 
decreases the inner products in Equation 2.5 at least one eighth the size of R. 
Since at each step of the query, quantity Wj decreases by at most ^==|i?| from 
Lemma 18, the total number of queries must be at least 

\W t - Wo I Vmm' 
> J — L ^ > 1 . 

" -y^ R ~ 16 

Vmm' 1 1 

This proves the query lower bound of Q(vW). □ 

We are now left with the proof of Lemma 18. 
Proof : Let 

n 

i^r 1 ) = ^uxAi)\(f>x,i), 
i=i 

where is the state of the algorithm before j-th query on input x. After j-th 

query, we get 

n 

i^> = E a *.*i < >i^,i>' 
i=i 

where \4>' x ^) is obtained from applying a query operator Q to \i)\(p x ,i)- Now suppose 
we have two input strings x = (xi, X2, ■ ■ ■ , xn) and y = (yi, y 2 , . . . , yjv), where we 
have exactly one % such that Xi ^ yi. For such i, we can rewrite as the part 

that involves such i and the rest, 

l^" 1 ) = uxAi)\<l>x,i} + W x ) 

and similarly for \ipi y ~ l ), 

Wy- l )=ay,V)\<Py,) + W y )- 
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The inner product (ip J x l \ipi y l ) can also be decomposed into two parts, the one 
that involves the i and the rest, 

(^rvr 1 ) = <^<</wi<M + w v w x ). (2.6) 

Similarly, we can rewrite \ip x ) and as 

Wx) = a x ,i\i)Q Xi \(f) x ,i) +QW X ) 

and 

W>=«y,i|i>0 W IW+QW- 
The inner product of the final state is, 

WM) = aZ&sC&QxAMM + W*)- (2-7) 

Note that the query in Equation 2.7 does not change the second term because 
we apply the same unitary transformation Q to the second registers for both \i/j x ) 
and \ipy)- They contain the same data, and the unitary transformation preserves 
inner products. So we only need to be careful about how much {4> y ,i\(i> x ,i) changes. 
Since the inner product of \(f> y> i) and \tp x ,i) is & t most 1 and Icc^a^l < la^Ha^l, 

i(^i^)-(^r i i^r i )i<2i^ii^i. 

However, we are interested in the difference above for all (x, y) G R, so 

\ w j- w j-i\ < 2 J] K.ilK,*! 

(x,y)eR 

< Yl (^\ a ^\ 2 + 7~ 1 K,;| 2 ) • 

(x,y)eR 

For going from the second to the third line above, we used an inequality 2AB < 
A 2 + B 2 with A = y/j\a X)i \ and B = v^K^I- 



CHAPTER 2. RELATED WORK 



40 



Now we bound 7|ax,i| 2 an d 7 1 | Q; ?/,i| 2 separately. 

(x,y)eR (x,y)eR 

(x,y)eR x£Ay:(x,y)&R 

= tI^I 

7' 



^ \R\ 



Above, we used the fact that given x, we have at most N different y's that differ 
by exactly one position: 

N 

l a x,i| = \ a x,i\ 1 

y:{x,y)£R i=l y:(x,y)eR,Xi^yi 

= f. 

The last line comes from the fact that the squares of amplitudes sum up to f . Also, 
since for every x G A, we have at least m different y G B that differ by 1, so 
I-RI > m\A\ and hence U| < ^. 

11 — 11 11 — m 

f l-Rl 

Similarly y r )~ l \a vi \ 2 < and we get 

7 m' 

(x,y)eR 

{x,y)dR 

< i\r\ + z±\r\ 



R\. 



The above expression is minimized when 7 = yP^j to give 



\Wj - W^_i] < \R\. 

\lmm! 



□ 



The Quantum Adversary Theorem we have proven is of the simplest form in 
that the yes and no instances only differ in exactly one position. The stronger form 
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of the previous theorem relaxes the number of % at which x and y differ to be more 
than one. This gives a tighter bound for several problems of interest. 

Theorem 2.4.2 [AmbOOj For a boolean function f : {0, l} n -> {0, 1}, let A C 
/ _1 (0), B C /-^l) andRCAx B. 

1. For all x E A, \{y : (x, y) G R}\ > to. 

2. For aZZy G S, |{x : G > to'. 

3. Define Z X)< = |{y : G i2, 7^ U Z tf)i = |{x : (x,y) e R, Xi ^ y,i}\ and 



T/ien f2 (y/ H*pi j queries are required. 



Unfortunately, it is proven by Szegedy [Sze03] and independently by Zhang [Zha03] 
that this method cannot provide a tight lower bound for all the problems. Infor- 
mally, a 1-certificate is the least number of bits of the input that determines the 
value of the function to be 1. If the size of a 1-certificate is Ci(/), and iV is the 
number of variables in the boolean function to the oracle, then the method can only 
prove up to the lower bound of 0{^/C~Jj)N) [Zha03]. For example, in element dis- 
tinctness, Ci(f) = 2, because we need to know the two elements that collide. Then 
this quantity is 0(\/n), but the tight lower bound of this problem is 6(n 2 / 3 ) using 
polynomial method [AS04]. 

The polynomial method [BBC + 01b] is another powerful lower bound technique. 
However, this method is also proven not to be tight by Ambainis [Amb03]. As 
far as we know neither the adversary nor the polynomial method provides a tight 
lower bound for all problems of interest. For some problem, the adversary method 
provides a better bound than polynomial method [Amb03] and the opposite also 
holds [AS04]. 



2.4.2 The Graph Connectivity 

As an application of Theorem 2.4.2, we take a look at the Graph Connectivity 
problem [DML03]. An undirected graph Q is described by (™) variables {Gij}, 
where Gij = 1 if is an edge in Q and otherwise. The oracle gives the entries 
of adjacency matrix Gij. We want to find if Q is connected by making as few 
queries to Gij as possible. What would be the lower bound for quantum query 
complexity? 

Let A be the set of graphs on n vertices that consist of two cycles not connected 
one to another, each cycle of length at least n/3. Let B be the set of graphs that 
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Figure 2.5: Transformations between G and G' 



are one cycle of length n. In both cases each vertex belongs to one of the cycles. 
We define the relationship as R = {(G,G') : G' has exactly two edges not in G}. 
We can obtain G' from G by deleting one edge from each cycle in G and inserting 
two edges to make it a single cycle. When connecting cycles, there are two ways, 
cross or parallel. So given G, the number of possible G' you can make is 

\{G' : (G, G') e R}\ = (length of first cycle) (length of second cycle) x 2 

\ onn 

- 3 3 
2n 2 

9 

because each cycle in G is of length at least n/3. Hence m = f2(n 2 ). 

Creating G from G' starts by picking one edge out of n edges in the cycle. Since 
each cycle in G is of length at least n/3, the next one must be at distance at least 
n/3 from the edge we just picked. This leads to n — 2 (n/3) = n/3 choices for the 
second edge. After that there is only one way to connect the vertices to create the 
two cycles. Hence wl = f2(n 2 ). 

For each instance in G, the number of instances in G' that differs at position 
i-e., lc,(i,j) 1S 0{n) or 0(1). If is an edge in G, and is not an edge 
in G', then there are < 2n/3 graphs G n s we can make by removing and one of 
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at most 2n/3 edges in G, and we have lc,(i,j) = 0(n). If is not an edge in G, 
but an edge in G', then there are four ways to create G' from G, e.g., by connecting 
and connecting a vertex to the left of i with the one left of j or connecting 
a vertex right of i to the one to the right of j. Similarly, lc,(i,j) = 1 if is an 
edge in G, and at most n otherwise. Overall then we have I < 0(n), and the query 



2.4.3 Lower Bound for Unstructured Search 

In this section, using Theorem 2.4.1, we prove a lower bound for a search on un- 
structured database [Amb03]. Unordered search is defined as follows. Given an 
oracle for x = (x±,X2, ■ ■ ■ ,x n ) G {0, 1}™, is there i such that X{ = 1? This lower 
bound is useful in later sections when we reduce from this search problem to the 
problems of our interest. This lower bound was first proven in [BBBV97] using a 
"hybrid argument". 

Theorem 2.4.3 [AmbOS] The query complexity of a search on unstructured database 
of size n is Q(y/n). 

Proof : Suppose we have n boolean elements, (x±,X2, ■ ■ ■ ,x n ). Let A be the set 
that contains exactly one Xi — 1 for some % G [n]. Let B be the set such that 
Xj = for all j. Then for every a G A, there are m = 1 elements in B that differ 
by exactly one position. For every b G B, there are mf — n different elements in 
A that differ by exactly one position. Using Theorem 2.4.1, the number of queries 
needed to search an element in unstructured database is Q(y/n). □ 



2.5 Quantum Matrix Verification Problem 

Suppose we want to verify if AB = C for n x n matrices A, B, and C over some ring. 
The oracle knows the entries of A, B, and C. What is the query and time complexity 
for this problem? Classically, there is an 0(n 2 ) time algorithm by Freidvals using 
random vectors [Fre79]. Classical query lower bound for this problem is Q(n 2 ), by a 
reduction from unordered search; Let A and B be matrices having all entries being 
1: Let C be a matrix with all entries being n. Then AB = C. If we set one of the 
3n 2 entries to be then AB = C no longer holds. Hence we are searching for one 
entry of out of 3n 2 entries. The classical lower bound for unordered search for n 2 
elements is Q(n 2 ), hence we have an Q(n 2 ) lower bound for matrix verification. 
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2.5.1 Upper Bound 

An 0(n 5 / 3 ) query upper bound can be obtained by using either Ambainis-Walk 
or Szegedy-Walk. The idea behind this is to perform a walk over r-subsets from 
the set of rows from A and another r-subsets of a set of columns from B, and the 
corresponding entries from C. For an n x n matrix M and an r-subset S of [n], 
let M\s denote a r x n sub- matrix of M corresponding to rows in S, M\ s an n x r 
sub-matrix of M corresponding to columns in S. Initially, we query r rows of A, r 
columns of B and r 2 entries of C corresponding to all these rows and columns. So 
the setup cost is 0(rn). When update, we swap in a new row for A, a new column 
for B and 2r entries of C, giving the update cost of 0(n). Checking is done by 
performing A\s x B\ T to see if it is equal to C\g for subsets S and T. Then the 
checking cost is 0. Here, we are looking for k = 2 elements, an index for a row in 
A and an index for a column in B that gives a wrong entry in C. The total cost if 
we use Ambainis-Walk is of the order of 

/fl \ 2/2 

rn + y—j (y/rn). 

Since we are looking for two elements that collide, e ~ ^ for r e o(n) and the 
spectral gap of the walk is K Then the query cost if we use Szegedy-Walk is of the 
order of 

TL 

rn H — T=(n). 
V r 

Here we see that both formulae give the same result, an 0(n 5//3 ) query upper 
bound when r = n 2 / 3 . 

Buhrman and Spalek [BS05] showed another Szegedy-Walk algorithm that uses 
random vectors to speed up the running time of the algorithm, the query complexity 
stays the same. In the original Szegedy-based algorithm described above, multiply- 
ing A\s with B\ T takes 0(nr 2 ) multiplications. This time can be reduced by using 
Freivalds' random vector technique on sub-matrices. At a setup stage, we multiply 
A\s with a vector u of length r and B\ T with another vector v of length r as well as 
computing uC\gV. During the walk stage we keep updating these three vectors. At 
the checking stage, the product of uA\s and B\ T v is tested against uC\gV. Then the 
setup cost is 2rn + r 2 = 0(rn), the update cost is In + 4r = 0(n) (a factor of two 
came from erasing and rewriting data), and the checking cost is 0(n). Note that we 
still need to query the same number of entries, i.e., 0(rn) entries, in the matrices 
as the original algorithm, and so the query complexity stays the same. Thus we 
focus on how much speed up there is in time complexity. The marked element is 
a pair of a row of A and a column of B such that when matrix A and B are 
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multiplied together via random vectors, it gives the incorrect entry of C at 
Note that since we are using random vectors, the fraction of marked elements and 
the fraction of elements that actually contribute to the product inequality, call them 
visible marked element are different. It can be shown, however, that the fraction 
of marked elements is close to the fraction of visible marked elements, and that we 
can minimize the error probability by calling this algorithm for a constant number 
of times, each time picking u and v randomly. Therefore, e ^ ^ for r G o(n). The 
eigenvalue gap 5 = £ as before, from Theorem 9. The time complexity of one run 
of the algorithm is 

n , . 
rn+—=(n), 
%/r 

which is 0(n 5 / 3 ) when r = n 2 / 3 . This algorithm is invoked for a constant number 
of times, hence the overall time complexity is also 0(n 5 / 3 ). Algorithm 3 describes 
the classical version of their algorithm. 

Algorithm 3 A Classical Algorithm for Testing If AB = C 

1: Create a random r-subset S of rows of A and another random r-subset T of 
columns of B. 

2: Pick a random 1 x r row vector u and a random r x 1 column vector v. 

3: Compute uA\s, B\ T v and uC\gV. 

4: while t < T do 

5: Swap one row of A and one column of B chosen u.a.r. 

6: Recompute uA\ s , B\ T v and uC\gV. 

7: Test if uA\ s x B\ T v = uC\ T s v. 

8: end while 

9: Answer 'AB=C" 



2.5.2 Lower Bound 

We use quantum adversary theorem to prove an Q(n 1 ' 5 ) lower bound [Amb05]. First 
consider a problem to test if Au = v, where A is an n x n matrix, u is a vector 
of length n with all the entries being 1, and v is a vector of length n with all the 
entries being n/2. Let a matrix A be balanced if each of its rows contains exactly 
n/2 entries that are 1 and exactly n/2 entries that are 0. Let unbalanced A to be 
such that n — 1 rows contain exactly n/2 entries of 1 but one row contains n/2 + 1 
entries of 1. Then for a balanced A, we have Au = v, but for an unbalanced A, 
we have Au ^ v. There are m = n{n/2) ways to transform a balanced matrix A 
into an unbalanced matrix by choosing one of n{n/2) entries that are 0. There are 
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m' = n/2 + l ways to transform an unbalanced A into a balanced A by choosing one 
of n/2 + 1 entries that are 1. The parameter / = 1 since balanced A and unbalanced 

A differs by exactly one position. Hence we obtain ^/ 1 n ( n / 2 )(™/ 2+1 ) _ Q^ 1 - 5 ) query 
lower bound for testing if Au = v. Let B consist of n entries of u in the columns 
and C to consist of n entries of v in the columns, then the above argument still 
holds, and so the lower bound for testing if AB = C is f2(n L5 ). 



Chapter 3 



Testing Commutativity of 
Matrices 

Suppose we have k matrices of dimension nxn. The entries of the matrix are given 
by an oracle with the input being a triplet and the output being the 

entry of Z-th matrix. We want to test if all the matrices in the set commute with 
each other or not by making as few queries to the oracle as possible. Classically, we 
need to query all the entries of the matrices by the following argument. Suppose 
all the matrices in the set contained all 1 entries. Then AB = BA for every pair. 
However, for every pair A, B, if we flip one of the kn 2 entries, say in matrix A, to 
then AB ^ BA for every other matrix B. Hence we have reduced the problem of 
unordered search among kn 2 items to testing commutativity, giving the lower bound 
of Q(kn 2 ). Quantumly, an unordered search of n elements takes Q(y/n) queries 
from Theorem 2.4.3 [Amb03], then by reduction, quantum query complexity of this 
problem is Vt{-\/kn 2 ). What would be the quantum query complexity of testing the 
commutativity of k matrices of size nxn? 

3.1 Commutativity Testing for a Single Pair 

Suppose we only want to test a single pair of matrices, that is to see if AB = BA for 
two nxn matrices A and B. The lower bound is obtained by the reduction from the 
unordered search as in at the beginning of Section 3 with k — 1. So quantum query 
lower bound is Q(n). The upper bound is obtained from a modification of matrix 
verification algorithm in [BS05]. When checking, instead of testing uA\s x B\ T v = 
uC\gV, we test uA\s x B\ T v = uB\s x A\ T v . This does not affect the overall time or 
query complexity of [BS05] in Section 2.5, and hence we have 0(n 5//3 ) upper bound 
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for testing AB = BA. 

3.2 Commutativity Testing of k Matrices 

Now let's take a look at the cases where we have k matrices to test the commutativ- 
ity. In presenting the quantum algorithms, we will describe the classical versions, 
as from Theorem 15, we only need to know the classical algorithm to bound the 
quantum complexity. 

3.2.1 Two Straightforward Algorithms 

The first algorithm performs a Grover search over all 0(k 2 ) pairs of matrices, at 
each step running a single pair commutativity testing algorithm that costs 0(n 5 / 3 ). 
Recall that the single pair commutativity testing algorithm in Section 3.1 was 
obtained from the modification of the bounded error matrix verification algorithm 
in Section 2.5. Then we have a bounded-error oracle. However, using the Theorem 
of [HMdW03] in Section 1.2.6, we can perform a quantum search with a bounded- 
error oracle with the same complexity as that with a perfect oracle. Hence, the 
query complexity of this algorithm is 0(kn 5 ^ 3 ). 

In the second algorithm, Algorithm 4 presented in the table below, we query 
fewer number of matrices by querying more entries per matrix. In order to estimate 

Algorithm 4 A Classical Version of the Second Straightforward Algorithm 

1: Create a random subset of r matrices. 

2: Query all the entries of the matrices in the subset. 

3: while t < T do 

4: Pick a matrix to be swapped u.a.r. from the subset and swap this with the 
one not in the subset also picked u.a.r. 

5: For the new matrix in the subset, query all the entries. 

6: Check if all the matrices in the subset commute or not. 

7: if There is a non commutative pair in the subset then 

8: print ''Non commutative .' ' 

9: return 
10: end if 
11: end while 
12: Answer "Commutative" 



the query, but not time complexity, we need to calculate the setup cost, update and 
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checking cost, and T the number of iterations as in Section 2.1.5. The setup cost 
is rn 2 by querying all the entries of r matrices in the subset. The update cost is n 2 
because we only need to query all the entries for the new matrix we swap into the 
subset. The checking cost is 0. T — -4= because from Theorem 15, T = -j= and 

( (fc : 2) ) 2 

5 — £ from Theorem 9 and e = ^ ~ fa for r G o(n), because we are looking for 
two matrices that does not commute. Applying these costs into Equation 2.4, 

rn H = m ). 

Optimizing this, we have r = /c 2 / 3 and hence the query complexity is 0{k 2 ^n 2 ). 

Notice that we could also think of this problem as element distinctness. Suppose 
that each element is a matrix, then we have a collision if two matrices do not 
commute. Since element distinctness can be solved in 0(k 2 ^) and we need to 
query each of 0(n 2 ) entries of the pair of matrices in question, this gives 0(k 2 ^ 3 n 2 ) 
query complexity. 

It is interesting to realize that although we could get the query upper bound 
using Szegedy-Walk, we could simply apply a Grover's search with a single pair 
matrix verification algorithm for the first algorithm, and element distinctness for 
the second algorithm. It seems we have not yet taken an advantage of Szegedy-walk. 



3.2.2 Walk Over Separate Rows and Columns 

The first straightforward algorithm repeatedly performs a walk over a set of rows of 
matrices. What if we walk over the rows and columns taken from all k matrices put 
together? Algorithm 5 describes the classical version of the walk. This algorithm 
keeps two different r-subsets, one for rows and one for columns. An element of 
r-subset for rows consists of (i, /), an i-th row of Z-th matrix, also denoted M^. An 
element of r-subset for columns consists of (j, m), a j-th column of m-th matrix, also 
denoted M j ' m . This is because we are looking for a pair of matrices (7, m) and pairs 
of rows and columns that do not commute i.e., M i: i x M j ' m ^ M i:tn x M j ' 1 , and 
so we need to separate all the rows and columns in different matrices. At each step 
of the walk, we pick one row and one column in the r-subsets and those not in the 
r-subsets u.a.r. and then swap these and update the data registers accordingly. At 
the checking step, the algorithm checks to see if there are rows % and columns j from 
two different matrices A and B. If so, we check the commutativity by multiplying 
the i-th row of A with j-th column of B, and see if it agrees with the product of 
i-th row of B with j-th column of A. 
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Algorithm 5 A Classical Walk Over Separate Rows and Columns 
1: Create an r-subset of rows by randomly choosing r rows among all the rows in 

k matrices. Similarly create another r-subset of columns. 
2: Query all the entries of the rows and columns in the subset. 
3: while t < T do 

4: Pick a row and a column u.a.r. from the r-subsets, and another row and 

column not in the r-subsets and swap these. 
5: For the new row and column in the subset, query all the entries. 
6: Check if there are rows i and columns j from two matrices A and B. If so, 
check if the product of row i of matrix A with the column j of matrix B is 
the same as that of row i of matrix B and the column j of matrix A. 
7: if There is a non commutative pair in the subset then 
8: print <( Non commutative .' ' 
9: return 
10: end if 
11: end while 
12: Answer "Commutative" 



The setup cost is 0{rn) because we have r rows and r columns in the subsets. 
The update cost is 0(n), because we need to query one row and one column. 
The checking cost is 0. We have two walks going on over row indices and column 
indices, each of a subset of size r. Then each walk operator has an eigenvalue 
gap of at least -, with Ai = 1, A2 < 1 — -• Since the eigenvalues of a tensor 
product of two matrices are the products of all the pairs of eigenvalues from the 
matrices, the largest eigenvalue is still 1-1 = 1 and the second largest eigenvalue 
is at most 1 • £ = K Hence the eigenvalue gap of the tensor product of the two 
matrices is 5 > -. The probability of having marked elements is the probability that 
we have noncommutative rows from two noncommutative matrices in the subset 
of rows times the probability that we have noncommutative columns from two 



noncommutative matrices in the subset of columns. Hence e = 




for r G o(nk). Hence our query complexity is 



rn + 




r 3/2 



Optimizing this gives 0(k i ^ 5 n 9 ^ 5 ) for r = /c 4//5 n 4//5 when r = o(nk). 

Note that when k — n, The first two straightforward algorithms both give n 8//3 , 
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and Algorithm 5 gives 0(n 13 / 5 ), hence Algorithm 5 has a better query complex- 
ity. However, when k < n 2//3 , the first straightforward algorithm in Section 3.2.1 
performs the best and when k > n 3 / 2 , Algorithm 4 performs the best. 

3.2.3 Simultaneous Quantum Walk 

Recall that in the first straightforward algorithm we repeatedly performed a walk 
over rows and columns of a fixed pair of matrices but no walk was performed 
over the matrices. In Algorithm 4, we performed a walk over matrices, but no 
walk was performed over the rows. What if we perform a walk over matrices and 
rows/columns at the same time? This is what Algorithm 6 does. The quantization 
of Algorithm 6 gives us another 0(/c 4//5 n 9//5 ) upper bound. Note that it has the 
same query complexity as that of Algorithm 5 from the previous section. 

Algorithm 6 A Classical Simultaneous Walk 

1: Create an r-subset of matrices S, an s-subset of rows R, and another s-subset 
C of columns. 

2: Query all the entries of the rows and columns in R and C of the matrices in 

the subset S. 
3: while t < T do 

4: Swap one matrix in the subset S with the one not in the subset chosen u.a.r. 
5: For the new matrix in the subset, query the s rows and columns in R and C. 
6: Swap one row and column in the subsets R and C with the ones not in the 

subsets both chosen u.a.r. 
7: For the new row and column in each of the matrices in the subset S, query 

all the entries. 

8: Check if all the sub matrices given by the subset commute or not. 

9: if There is a non commutative pair in the subset then 
10: print <( Non commutative .' ' 
11: return 
12: end if 
13: end while 
14: Answer "Commutative" 



In Algorithm 6, we maintain two different s-subsets for rows and columns. We 
keep all the rows and columns from all the matrices in the r-subset from the same 
set of row indices and column indices as the data. So the idea behind the algorithm 
is to keep updating the set of indices for matrices, rows, and columns. At each step 
of the walk, we get a new matrix and query the entries of this new matrix. Then 
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for each matrix in the r-subset, we update a row and a column. Then the setup 
cost is 0(rsn) for querying each entry of an s x n submatrix for each matrix in 
r-subset. The update cost is 0(rn + sn), O(sn) for a new matrix we just swapped 
in, and 0(rn) for a new row and a column for each matrix in r-subset. The checking 
cost is because checking is done by computing the product of submatrices whose 
entries we already know. We now calculate 5. Let P be the operator acting on 
matrix indices and Q = Q r <g> Q c be the operator acting on row and column indices. 
The eigenvalue gap for P is 1/r and for Q is l/s. Then 5 = min{l/r, 1/s}. The 

2 



k — 2\ \ / /n— 1 



probability of having noncommutative submatrices is e = ^ Kr ^ J J y ^ J for 
r G o(k) and s G o{n). Thus we have a total query cost of 

rsn H a/ max{r, s}(rn + sn). 

Since r G o(/c) and s G o(n), minimizing this gives 0(k^ 5 n 9 ^ 5 ) with r = s = 
k 2 / 5 n 2 / 5 when k 2 ^ <n< k 3 ^ 2 , 0(kn 2 ) with r = s = 1 otherwise. 

Note that walking for multiple steps before checking mixes the elements of 
subsets well without changing the eigenvalue gap. Then can we do better if the 
underlying classical Markov Chain is P u <8> Q v ', that is, perform u steps of the walk 
P over the matrices and then v steps of the walk Q over the rows/columns indices? 
It turns out that the increased cost of updating diminishes any gain from having 
the same eigenvalue gap. 

Theorem 3.2.1 Having M = P u <g> Q v for positive u and v as an underlying 
classical Markov Chain does not give any better query complexity than having M' = 
P®Q. 

Proof : We still have the same setup, the checking cost and e as before. So 

f( k z 2 )\ ( 7™i 1N )\ 2 

the setup cost is 0(rsn), the checking cost is and e = I yr ^ J J I y %^ J J for 

r G o(k) and s G o{n). The update cost this time is {usn + vrn). We need to 
analyze the eigenvalue gap of M — P U Q V . From Theorem 9, the upper bound of 
the eigenvalue gap is 1/r, hence the second largest eigenvalue is at least 1 — 1/r. 
Then the largest eigenvalue of P u is still 1 and its second largest eigenvalue is at 
least (1 — l/r) u . Similarly, the second largest eigenvalue of Q v is at least (1 — l/s) v . 
Then the largest eigenvalues for P U Q V is still 1 and the second largest is at most 
max{(l - l/r) u , (1 - l/s)"}. Then 5 > min{l - (1 - l/r) u , 1 - (1 - l/sf}. Then 
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we have 

T = -i- 



= — max 

rs 



Hence we have 



kn 

rsn + (usn + vrn) — max 



I . /I _ (1 _ I)« /i _ (i _ I), 



Next, we express r and s in terms of k and n that gives the optimal bound. 
We first note that (f — l/r) u «1 + u{—l/r) = 1 — u/r for r = uj{1) by taking 

the first two terms of binomial expansion. Hence \Jl — (1 — j) u \Jujr. Then 
we get the following bound for the cost, 



rsn + (usn + vrn) — max < — =, —= 
rs Ju Jv 



Suppose r/u > s/v, then r > su/v and vrn > usn. Then we get 

kn Jr- 
rsn + vrn 1=. 

rs y'U 

Simplifying this, we get 

kn 2 vJr 

rsn H =— . 

s^/u 

Both the first and the second terms of the sum above is an increasing function 
of r, so we want to set r to be the minimum. Since r > su/v, we set r = su/v. The 
new simplified formula is then, 

s 2 un kn 2 \Jv 

V y/s 

Since the first term of the sum above is an increasing function of s but the 
second term is a decreasing function of s, we set the first term to be equal to the 
second term, 

s 2 un kn 2 y/v 

V y/s 

Solving this gives s = fc2/jn ^^ 3/j ; anc i the query complexity is 0(k 4 ^n 9 ^ 5 v 1 / 5 -u 1 / 5 ) 
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for k 2 / 3 -^ < n < k 3 ! 2 "^-. Otherwise, we get r = s = 1 with complexity 0(kn 2 v). 
Similar arguments holds for when r/u < s/v. We see that since u and v are pos- 
itive, the best upper bound achieved by applying M = P U Q V does not give any 
better query bound than simply applying M' = PQ. □ 



3.3 Generalization of Simultaneous Quantum Walks 

In the previous problem of testing the commutativity of k matrices in Section 3.2, 
the marked state depended on two parameters, a set of matrix indices and the set 
of row/column indices. The best upper bound was obtained by a simultaneous walk 
over these two sets of indices. Suppose now the condition of being marked depends 
on m parameters. Then we can obtain a better upper bound than straightforward 
application of Grover's search or that of quantum walk by having a walk in each 
of m subsets in parallel, at each step of the walk, updating each of the parameters. 
For example, for the commutativity testing of a matrix set, m = 2 and so at each 
step, we updated a matrix set and a row/column set. The setup, the update and 
the checking cost, as well as e depends on how the data are stored. However, 5 is 
the minimum eigenvalue gap among all the walk operators. Hence if we have m 
subsets of size r\, r% . . . , r m , then 5 = minj{^-}. Below is an example problem that 
is reduced to testing the commutativity of k matrices problem by having only one 
element in each set. 

3.3.1 Example Problem 

Suppose we have m sets of matrices, each containing k matrices of size n x n. We 
are promised that within each set, the matrices commute. Are there two or more 
sets, when combined, give a noncommutative set of matrices? 

3.3.2 Upper Bound 

The following is an 0(m G ^ 7 k 6 ^ 7 n 13 ^ 7 ) algorithm by a simultaneous quantum walk 
over the sets, matrices and rows/columns. 

The idea is to form subsets of the set of matrices, matrix, and row/column and 
query all the entries corresponding to them at a setup stage. At each step of the 
walk, we swap a new set, a new matrix, and a new row/column and update the 
entries accordingly. The checking is done by computing the product of each pairs 
of matrices without any further query. See Algorithm 7 for details. Then, the setup 
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Algorithm 7 A Classical Algorithm for Solving Collisions with Three Parameters 

1: Create a t-subset S of sets, r-subset M of matrices and s-subsets R and C of 
rows/columns. 

2: Query all the entries of the rows and columns in R and C of matrices in M 

that are in sets S. 
3: while t < T do 

4: Swap one set in S with one not in S by choosing the elements u.a.r. 
5: Query s rows and columns in R and C for all the r matrices in M in the new 
t-subset. 

6: Swap one matrix in M with the one not in M both chosen u.a.r. 
7: Query s rows and columns in R and C for the new matrix in each of t sets 
in S. 

8: Swap one row and column in R and C with the ones not in R and C both 
chosen u.a.r. 

9: Query a row and a column for the new row and column in each of r matrices 

in M in t sets in S. 
10: Check if all the matrices in the subset commutes or not. 
11: if There is a non commutative pair in the subset then 
12: print <( Non commutative .' ' 
13: return 
14: end if 
15: end while 
16: Answer "Commutative" 
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cost is 0(trsn), because we need to query s rows for each of r matrices in each of 
t sets. The update cost is 0(rsn + tsn + rtn), rsn for when swapping sets, tsn for 
when swapping matrices, and 0{rtn) for when swapping rows/columns, e.g., for a 
new set, we need to query the entries of r matrices, and for each matrix, we keep 
s rows and columns. The checking cost is because we have already queried the 
entries of submatrices at the setup and the updating stages. The eigenvalue gap, 
5 = min{l/t, 1/r, l/s}, and e « ^2^2 for t G o(m), r G o(k), and s G o(n). Then 
our query complexity is 

trsn H —{rsn + tsn + rtn) 

for 5, e as stated above. By optimizing this, we get a cost of 0(m^ 7 k 6 / 7 n 13/7 ) 
with t = r = s = m 2 / 7 k 2 / 7 n 2 / 7 for k 5 ^ 2 n 5 ^ 2 < m, m 5 ^ 2 n 5 ^ 2 < k, and m 5//2 /c 5 / 2 < n. 
Oikmn 2 ) otherwise. On the other hand, if we perform a simple Grover's search by 
searching on a pair of sets and within each pair of set, a pair of noncommutative 
matrices, then it costs 0(mkn 5 ^ 3 ). Applying element distinctness over pairs of sets 
and within each pair, applying Grover's search over 0(k 2 ) pairs of matrices, and 
for each pair of matrices, applying a single pair commutativity testing algorithm in 
Section 3.1 gives 0(m 2 ^ 3 kn 5 ^ 3 ) query complexity. 

3.3.3 Lower Bound 

Q(m 1 / 2 k 1 / 2 n) lower bound is obtained by quantum adversary argument. 

Let A be the set such that m/2 sets contain pseudo-identity matrices, i.e., for 
1 < i < to/2, 1 < j < k, the j-th matrix in i-th set consists of diagonal entries 
of all ij. The other m/2 sets contain matrices with all the entries being the same 
and non-zero. For m/2 < i < m, the j-th matrix in i-th set contains all ij entries. 
Then within each of the m sets, the matrices commute with each other. Also all of 
mk matrices commute with each other. 

Let B be the set such that one of k matrices in one of m/2 sets that contain 
pseudo-identity matrices has one of off diagonal entries being flipped from zero to 
the same entry as in diagonal. Then within this set, the matrices still commute 
with each other because the rest of the k — 1 matrices are pseudo-identity. Within 
each of the other sets, the matrices still commute, because they are not affected. 
However, a set consists of the matrices from the modified set and the matrices from 
one of m/2 sets that contain all-same-entry matrices, gives non-commutative pairs, 
m = m/2kn 2 , m! = 1 and 1 = 1. So the lower bound is ^Jm/2kn 2 = Vlinr^^k^^n). 



Chapter 4 

Summary and Future Work 



We have seen two different kinds of quantum walk; Ambainis-Walk and Szegedy- 
Walk, which are tools for providing upper bounds for triangle finding problem and 
other matrix related problems. Both of the walks give the same query upper bound 
for matrix product verification. However, for triangle finding problem, Ambainis- 
Walk gives a better query upper bound. In fact, we have shown that with the same 
setup, update and checking cost for time or query complexity, Ambainis-Walk gives 
a better bound. On the other hand, Szegedy-Walk gives a better upper bound for 
time complexity in matrix verification problem. Moreover, there is an algorithm for 
testing commutativity of a general group [MN05], where analysis of Szegedy-walk 
is more powerful. 

Both of these walks are discrete in the sense that each time step of the walk is 
discrete. There is another kind of walk called continuous walk, where the walk is 
performed with a time step e where e — > 0. There is an application of continuous 
walk that gives an exponential separation in quantum query complexity [CCD + 03] 
from the classical counterpart. There is no exponential separation shown using 
discrete time walk so far, however. For some problem such as a search on N x N 
grid, discrete walk performs quadratically better than continuous walk without 
ancilla [AKR05]. Whether discrete walk is more powerful than continuous walk 
is an open question, although it is suspected that these give essentially the same 
behaviour. 

We have also seen Ambainis's quantum adversary theorem for proving lower 
bounds. This technique is used to prove a lower bound of Q(y/n) for a search on 
unstructured database. From this problem, we may derive lower bounds for many 
of the problems studied in this essay. 

For testing the commutativity of k matrices of size n x n, we learned that there 
are three query complexities 0(/cn 5//3 ), 0{k 2 ^n 2 ) and 0(A; 4 / 5 n 9 / 5 ) and depending 
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on the relationship between k and n, one upper bound is better than the others. 
The lower bound for this problem is Q.{k}l 2 n). 

For future work, we would like to classify what kinds of problems are better 
suited using Ambainis or Szegedy Walk. Also, we would like to come up with an 
upper bound for the matrix commutativity testing problem, that either supersedes 
or incorporates all the three upper bounds. Since the gap between the current 
upper bound and the lower bound is wide, we need to close the gap as well. We 
are not sure if quantum adversary method can prove a tight lower bound for this 
problem, and investigating other lower bound methods is also of interest. 
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